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1. Introduction 

The microcanonical ensemble is considered to be the fundamental ensemble of statistical 
physics. For example, the use of the canonical Gibbs ensemble is usually justified 
by showing that its expectation values coincide with the microcanonical ones in the 
thermodynamical limit, when the size of the system approaches infinity. However, 
since an application of the microcanonical ensemble requires detailed knowledge about 
the energy levels of the system, it is seldom possible to use it in practice. The 
canonical ensemble, on the other hand, has a well-behaving path-integral expression 
easily extended to gauge field theories — therefore it has become the standard ensemble 
of quantum statistics. 

Nevertheless, there are situations where the easiest, canonical, alternative does not 
work. For instance, direct applications of the grand canonical ensemble have not been 
able to reproduce all the results of relativistic ion collision experiments [0J . There are two 
possible explanations for this shortcoming of the canonical ensemble: either the particle 
gas created in the collision does not reach thermal equilibrium before exploding into the 
final state hadrons, or the system is too small to be handled by canonical methods. In 
fact, recent calculations || using a modified grand canonical ensemble have succeeded 
in describing most of the particle abundances in these experiments, but only at the cost 
of including finite-volume corrections to the usual ensemble. This suggests that at least 
the final state hadron gas will thermalize, but it also shows that finite-volume effects 
are prominent in these systems. 

These results point into the direction that the microcanonical ensemble — which is 
optimal for describing isolated ergodic systems with small quantum numbers — should 
be used for getting quantitative information about the properties of the quark-gluon 
plasma possibly created in relativistic hadron collisions. There are already proposals 
how this can be accomplished in the continuum path-integral formulation of field theories 
@, but since the argumentation in this kind of formalism cannot be made completely 
mathematically rigorous, a further study of the details of the quantum microcanonical 
ensemble was felt to be necessary. 

When trying to do rigorous quantum microcanonical computations, one 
immediately encounters two practical difficulties associated with the discrete spectrum 
of an isolated quantum system: since the spectrum is discrete, the possible values of the 
spectrum have to be known in advance before any computations can be done; on the 
other hand, the position of the high energy spectral levels depend on small fluctuations 
of the interaction potential. Consider, for example, the harmonic oscillator, for which 
the energy levels are E n = u(n + §): for n = 100 one percent change in the oscillator 
frequency u will change the position of the level i^ioo by a whole energy unit! Therefore, 
the spectral levels by themselves are not very practical parameters; however, the density 
of the spectral levels is robust in fluctuations of the potential and thus a smoothened 
energy spectrum would offer a more stable alternative. 

This paper attempts to fill the gap between the canonical and the microcanonical 
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ensemble in quantum mechanics. The main ingredient in this is the introduction of the 
Gaussian ensemble, which is essentially just a regularization of the discrete spectrum of 
a closed quantum system. The Gaussian ensemble will be defined in section two, where 
we will also show how all microcanonical results can be obtained from the Gaussian 
ensemble in the limit where the regularization is removed. In the following sections 
we will then show that in a certain range of the regularization parameter the canonical 
ensemble is a good approximation of the Gaussian ensemble and we will propose definite 
ways of estimating the difference between the two methods. These results are then 
employed in the derivation of an asymptotic expansion of microcanonical corrections to 
the canonical ensemble, which will be most useful for systems near, but not quite at, the 
thermodynamical limit. Finally, we will give some implications of the present results 
to the thermodynamics of systems with exponentially increasing density of states, for 
which the canonical ensemble is in principle not well-defined. 

We will not repeat standard results or definitions of statistics of quantum systems 
in the thermodynamical limit here. The physical argumentation leading to the density 
operator formulation is explained in most textbooks on the subject, volume five of the 
classical series by Landau and Lifshitz |4j] being a good example. More elaborate and 
recent analysis of the subject can be found from volume one of the series by Balian ||, 
while a mathematically rigorous approach is developed in volume four of "A Course in 
Mathematical Physics" by Thirring |J. 

2. Gaussian ensemble|| as a regularization of the microcanonical ensemble 

Consider the operator defined by 



where H is the Hamiltonian and E and e / are two real parameters. Since H is 
self-adjoint, p e (E) is bounded, self-adjoint and positive. For this operator to define a 
sensible statistical ensemble via the trace formulas, it is also necessary to require that 
the spectrum of the Hamiltonian is discrete and increases sufficiently fast at infinity so 
that Tr f> e {E) < oo. If this is true, we will define the Gaussian expectation values of an 
observable A by the formula 



At this point, it will be useful to define some terminology to be used later. If 
the Hamiltonian of the system satisfies Tr p" £ (E) < oo for all E e M and e > 0, we 
will call the system Gaussianly bounded and define the above trace as the Gaussian 

\ On physical grounds, the idea of using a Gaussian energy distribution to define a statistical ensemble 
seems natural. Such a physical reasoning was adopted, for instance, in [Q to introduce a Gaussian 
ensemble for studying first order phase transitions in certain lattice gauge theories. 




(1) 




(2) 
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partition function Z^ ss . Similarly, if Tre - ^ < oo for all (3 > 0, we will say that the 
system is canonically bounded and the trace will give the canonical partition function 
Z c ^ n . The terms Gaussian and canonical observable, respectively, are then used for 
those normal or bounded operators (i.e. observables) A for which (|t4|)^ ss < oo or 
(|A|)^ an < oo for all the above values of the parameters — here \A\ refers to the positive 
square root of A^A. Analogously, systems with a discrete energy spectrum with finite 
multiplicities are called microcanonically bounded and all normal operators having a 
domain which contains the domain of the Hamiltonian as well as all bounded operators 
are microcanonical observables. In this last case, the microcanonical partition function 
^micro j g Qigfingo! by ^he number of eigenstates with energy E and the microcanonical 
expectation values are the averages over these eigenstates. 

Clearly, every canonically bounded system is Gaussianly bounded and every 
canonical observable is a Gaussian observable, and the same relations hold between 
the Gaussian and the microcanonical ensemble as well. The term bounded used here 
has also a direct physical interpretation in the canonical case: by the Golden-Thompson- 
Symanzik inequality ||, which applies for n-dimensional systems with Hamiltonians of 
the form H = |p 2 + V(x), a system is canonically bounded if the potential increases 
fast enough at infinity so that J ^xe"^'^ < oo for all (5 > 0. 

The formal relation lim^o Pe{E) — $(H — E) is one obvious motivation for using 
this ensemble to approximate the microcanonical one. However, it can also be argued 
that the Gaussian ensemble is even better suited for describing typical experimental 
situations than the microcanonical ensemble: if a quantum system was initially prepared 
into, or was measured to have, an energy E and the system is almost, but not completely, 
isolated having interactions with the environment that lead to energy fluctuations of the 
order of e, then the Gaussian ensemble using these parameters is the most natural way 
to predict the behaviour of a statistical average over many independent measurements 
of an observable. Of course, for this physical interpretation to be valid, the interactions 
with the environment need to be balanced in such a way as not to lead to a net flow 
of energy from one direction to the other — this is the essence of the requirement of 
"thermalization" of the system in the context of the Gaussian ensemble. 

Mathematically, the Gaussian ensemble is a regularization of the discrete energy 
spectrum by a convolution with the normal distribution. As was explained in the 
introduction, this is beneficial since it offers a way of removing the effect of the instability 
of the high energy spectral levels. Also, its use does not require any prior knowledge 
about the spectrum as it is well defined for all values of E. Most important is, however, 
the way how the Gaussian ensemble offers a natural and mathematically rigorous 
approximation of the microcanonical ensemble: 

(i) lim £ ^o V2ne*Z e ™ ss = Zf cro for all real E. 

(ii) lim e _^o Z^ ss = Tr S(H — E) as distributions in E. 

(iii) \im £ ^ (A)^ ss — (A)™, cm , where E' is the energy eigenvalue nearest to E. 
We will now conclude this section by proving these three statements. 
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Suppose that the system is Gaussianly bounded with discrete energy levels E n , 
each one having a multiplicity K n and eigenvectors f) n k = 1, . . . , n n . Then for any 
Gaussian observable A, the trace in (0) can be expressed as 



n,k 



(Qn,k\M^n,k) ■ (3) 



Denote {Vt n ^\A\Vt n ^) by a n ^ and note that since |a n) fc| < (fi n ,fc| |^4| |^n,fc)j the above 
series converges absolutely for any e > 0. 

Let us first look at the behaviour of the Gaussian partition function, i.e. use A = 1. 
By (D then 



V^Z|7 s = ^ Kn exp 



Since for any W > the function e _M/ / £ is an increasing function of e in the region 
e > 0, dominated convergence can be invoked to move the limit e — > inside the sum, 
which then gives the result 

lim y27T£2Z gauss = I K «' if£ = - E ™ for some n 
e^o E > £ |0, otherwise 

Since the right hand side equals Z™ lcro by definition, this proves the first statement. 

Let next f(E) be any smooth function with a compact support and let M > be 
such that |supp/| < M. To prove the second statement, we need to show that 



/oo 
dEf(E)Z^ ss = J2^nf(E n ). 
■oo „ 



Since J'^dE f(E) -?=p exp [— (-^n — -E) 2 ] — ► f(E n ), when e — >■ 0, this is clearly true 
if only it were possible to move first the ^-integration and then the e — > limit inside 
the sum. But, in fact, both of these operations are now allowed by the dominated 
convergence theorem, since we have the bounds 

, , ..expT-A (E n -E) 2 ] 
dE\f(E)\ PL ^IJL LI 

oo \l 

„ if |£ n | < 2M 

,exp[-± (|£ n | - M) 2 + iM 2 ] , if \E n \ > 2M ' 



< 



for all < e < 1. 

Let us finally evaluate the limit of the expectation values for general A, 

«sr = e exp[ :^ ( ^ )2] ^^i^)- (4) 

For this we will need the result 

1 e ^o I 0, if \E n r — E\ < \E n — E\ for some n' 

(En' Kn'S^-EUEn-E])' 1 , otherwise 



E ex P 

n'k' 



(E n ,-E)*-(E n -E)* 
2e 2 



(5) 
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In other words, this limit is zero if E n is not the eigenvalue nearest to E, it is 
+ K m )~ l if both E n and E m are nearest eigenvalues (i.e. if E lies exactly in the 
middle of the segment joining E n and E m and no other eigenvalues are on this segment) 
and it is k" 1 if E n is a unique nearest eigenvalue. Let us use notation M n (E) for the 

SUm Yju> K n>8\E n ,-E\,\E n -E\- 

Let no be the index of any one of the eigenvalues nearest to E. Then for all n 



exp [-^ {E n - Ef] exp [-^ (E n - E) 2 ] 1 

< exp 



Tr p £ V2ire 2 £ Kn , eX p 



2 £ 2 



(E no — E) 2 — (E n — Ey 

2^2 



,(6) 



where {E no — E) 2 — (E n — E) 2 < 0. This implies that, for all e in the range 
< e < 1, the absolute value of each of the terms of the series in (|4]) is less than 
exp[(E no — E) 2 /2]exp[—(E n — E) 2 /2]\a n! k\, which again form an e-independent sum 
that is convergent by assumption. Thus dominated convergence can be applied to move 
the limit inside the sum, which then by equation (|5|) yields the result 

(A)eT ^ M ^5\E n -E\,\E no -E\(tt n> k\A\n n>h ). 

Since M no (E) is the number of non-zero terms in the above sum, the final expression 
is nothing but the average of the expectation values of A over the energy eigenstates 
nearest to E. 

Therefore, whenever E coincides with a point in the spectrum, the limit e —>■ 
will give the microcanonical expectation value. On the other hand, if E does not 
belong to the energy spectrum (in which case the microcanonical ensemble is in principle 
ill-defined), then the microcanonical result corresponding to the nearest eigenvalue is 
obtained. The only values of E giving non-microcanonical limits are those lying exactly 
in the middle between two eigenvalues, but even then the result is an expectation value 
of a uniform distribution over two energy eigenvalues. 



3. Canonical ensemble as an approximation to the Gaussian ensemble 

We will next show how the canonical ensemble can be used for approximating the 
Gaussian ensemble in the region where the energy resolution e is sufficiently large. For 
this we need to assume that the system is canonically bounded and that A is a canonical 
observable. Since this, in particular, requires the energy spectrum to be bounded from 
below, we will also assume that the Hamiltonian has been normalized so that the lowest 
energy level E is non-negative. 

Let us first assume that (3 is a positive parameter. Since 

-^(E - E n ) 2 = —-^—(E -E n + (3e 2 ) 2 + (3E - (3E n + \(3 2 e 2 , 

we have the identity 

Tr(lp £ (£)) =e^ 2£2+ ? E Tr(Ap £ (E + (3e 2 )e-^ . (7) 
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If we integrate this multiplied by e @ E over E and take the integration inside the trace, 
which is possible since A is a canonical observable, we get the exact formula 

Ti^Ae-^ = e -s /jV J°° dE e^ E Tr (Ap £ {E)^ (8) 

valid for all e > and (3 > 0. This is a regularized form of the familiar statement that 
the canonical ensemble is the Laplace transform of the microcanonical one. 

The result (|3|) has a more interesting inverse formula, which we will derive next. 
We will begin with the Fourier-transform of the normal distribution, 

which can be applied to ([7|), yielding 

In changing the order of the integration and the trace we again had to use the assumption 
that A is a canonical observable. Using now a new integration variable w = (3 + ia we 
get a particularly simple form of the desired inversion formula for (§), 

Ti(Ap £ (E))= —ey* w2+wE Tr(Ae- wS ) (10) 

V ' J/3-ioo 2m V ' 

valid for all (3 > 0. We can now conclude that the analytical form of the canonical trace 
contains all the information needed to compute the microcanonical expectation values, 
which are then easily extracted from the integral in (10). However, this formula leads 
also to a simple relation between the usual real-temperature canonical ensemble and the 
Gaussian ensemble which we shall inspect next. 

For all canonical observables, the trace Tr (^Ae~ wH j is obviously an analytic function 
of w in the half-plane Re w > and all its derivatives are given by a differentiation inside 
the trace, i.e. 

Ah ~ 

— Tr^Ae-™^ = Ti^A(-H) k e~ wS ^j . 
Therefore, saddle point methods can be used in evaluation of the integral in (|T0|) , 



/3+ioo dw 



27i 6XP 



-e 2 w 2 + wE + \nTr{ A( -"" 



i f3— ioo 

Here the branch of the logarithm needs to be chosen so that the logarithm is analytic 
on the integration contour; if the contour happens to go through a zero of the trace, 
then an infinitesimal deformation of the contour is necessary — note that this is always 
possible if we only make the trivial assumption A ^ 0. The saddle point equation, which 
is of course independent of what branch we use for the logarithm, is 



Tr(AHe- wS 
Tr (Ae~ w ** 



E + e 2 w, (11) 
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while the second derivative, which will determine the direction of the the steepest descent 
path, is 



Tr (^AH 2 e- w ^ / Tr (AH e~ wS \ \ 2 



Tr^Ae-^) V Tr^le-^) 



+ e 2 . (12) 



So far we have assumed the observable A only to be a canonical one. For 
the following discussion we shall also assume that A is positive, non-zero and that 
Tr yAIlj = 00, keeping in mind that the special case A = 1 falls into this category. We 

shall also use the notation O for Tr (^AOe~ w ^ / Tr (^Ae~ w ^ j . With these definitions, the 

saddle point equation becomes H = E + e 2 w and the second derivative is (H — H) 2 + e 2 . 
If we restrict ourselves to the positive real axis, then Tr(^Ae~ w ^ will always be 



strictly positive and both H and (H — H) 2 will be well-defined and positive. From 
this we conclude that on the positive real axis the logarithm hiTr^Ae - ™^ is a convex 

function and that the expectation value H is strictly decreasing. Therefore, the saddle 
point equation fllTD has at most one positive solution for each E. On the other hand, 
the assumption Tr {^AHj = 00 can be used for showing that a solution exists for every 
£Gl and e > 0. We shall now assume that /3, which was an arbitrary parameter of the 
integration contour in ( |IUD, has bee n chosen to equal this unique positive solution. Since 
the second derivative is (H — H) 2 + e 2 , it is now strictly positive, and the integration 
contour (3 — ioo — > (3 + ioo in fact goes through the saddle point (3 via the path of 
steepest descent. Using the saddle point approximation to evaluate the contribution of 
this saddle point to the integral gives then 

From this, equation ([/]) and the saddle point equation we obtain the following exact 
result and its saddle point approximation 

Tr (Ap e (Ej) =e-^ 2e2+ ^Ti[Ap e {H)e~^ (13) 
_L( e 2 + ^71p) 4 e 4^ 2 +^Tr(le^) . (14) 



27T 

So far we have only inspected the saddle points on the positive real axis. However, 
as a simple example with e.g. a harmonic oscillator will show, there will generally be a 
countably infinite set of saddle points on the complex plane. Also, it is quite possible 
that the steepest descent path going through all the relevant saddle points will not stay 
on the right half-plane, which will be unfortunate unless the analytical continuation of 
the canonical trace over the imaginary axis is known. In addition, the values of (3 at 
which the traces Tr^Ap e (i?)j and Tr p £ (E) need to be evaluated are typically different, 
which would then mean that the ratio of their saddle point approximations does not 
exactly equal the canonical expectation value. 



Microcanonical improvement of the quantum canonical ensemble 



9 



However, as we shall see in the next section, in the thermodynamical limit these 
considerations are not relevant and the saddle point approximation using the positive (3 
will give accurate results so long as we use a suitable e and inspect only large enough 
energies E. For the partition function this saddle point approximation reads 

^gauss ^ 1 -i^V+/3H Z can (15) 

E ' £ y/2n(a 2 + e 2 ) " y J 

where H = (H) c ^ n , a 2 = (H 2 ) c p n — H 2 and (3 is the unique positive number satisfying 
(H) c ^ n = E + e 2 f3. Using this same (3 also for the other trace in the expectation values, 
will give the following exact result and an approximation of the Gaussian expectation 
values, 



Tr(p e (ff)e-tf 



(A)f™ s = \ — « (A)f n . (16) 

From this formula we can already get some idea when the approximation will be most 
accurate: the contribution of the additional term %(H) will be small when the variance 
of the Gaussian peak is greater than the variance of the canonical distribution, i.e. 
whenever e 2 > a 2 . We will derive more quantitative bounds for the accuracy of the 
approximation in the next section. 

From the above result we can give a new interpretation of the role of the 
canonical ensemble in quantum statistics: canonical ensemble is an approximation of 
the regularized microcanonical ensemble from which the discrete structure of the energy 
levels has been smoothened out. We have also seen that the canonical approximation 
works best in the limit e/a — ► oo. However, for a fixed energy E a minimum requirement 
for the Gaussian ensemble to give meaningful results is e < E — E , so that taking this 
limit to its extreme is not possible in practice. In the next section we will propose a 
procedure for inspecting how and when a suitable compromise for e can be found. 

Finally, we would like to comment on the interpretation of the canonical entropy, 
S^ n = f3(H) c p n + lnZ§ an , in view of the previous approximation. Since the value of 
Zjf" ss gives the density of the energy eigenstates at energy E — density as the number of 
eigenstates per the energy interval e — its logarithm can be interpreted as the entropy of 
the system and we denote S^^ B = lnZ|?^ ss . On the other hand, from the saddle point 
approximation (O) we conclude that 



Sf™*Sp- lne-1 In 



° 2 

2?r I 1 + — 



^ 2 /5 2 , (17) 



which means that the canonical entropy gives a good approximation of the logarithmic 
density of states per energy e provided the energy resolution is proportional to the 
canonical energy deviation, i.e. e oc a. If the energy resolution is microscopic, then the 
above formula implies a correction to the canonical entropy of the form In N, N being 
the number of particles. Thus a natural interpretation for the canonical entropy is the 
entropy measured from the density of states per energy a, the standard deviation of 
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the canonical ensemble — note that the relation between the density of states and the 
dimensionless number given by the canonical entropy is dubious in any case. 



4. Efficiency and improvements of the canonical ensemble 



In this section we will continue working with the same setup as in the previous one, 
i.e. assume that the system is canonically bounded, A is a canonical observable and the 
parameter (3 is the unique positive solution to the equation (H) c ^ n = E + e 2 f3. Since 
we will be mainly using the canonical expectation values here, we will drop both the 
superscript "can" and the subscript (3 from these expressions; similarly, using the "hats" 
to signify operators will become cumbersome and we will abandon the practise at this 
point. 

Our first aim is to derive quantitative bounds for how well the canonical ensemble 
approximates the Gaussian one and then to derive a method for computing corrections 
to the canonical ensemble when it first begins to fail. The corrections will be expressed 
in terms of the normalized moments of the canonical distribution and for this reason 
we will now adopt the notation a 2 for the variance ((H — (H)) 2 ) and then define the 
normalized Hamiltonian h by the formula 
H-{H) 



h 



a 



If A is positive and non-zero, then Jensen's inequality 
the following bounds valid for any a6l, 



exp 



(Ah 2 )\ (Aexp(-ah 2 )) 



< 



< 1. 



can be used for deriving 



(A) J ~ (A) 

If we apply this to (Ae~ ah2 exp(ah 2 )) / (Ae~ ah2 ) , we can improve also the upper bound: 



exp l —a 



(Ah 2 )\ (Aexp(-ah 2 )) 



< 



Since (h 2 ) 



(A) J ~ (A) 
1 by definition, we get for A = 



< exp 



(Ah 2 ) ( (Ah 4 ) 
-a , ,, exp I —a 



(A) 



(Ah 2 ) 



exp(— a) < (exp(— ah 2 )) < exp[— aexp(— a(h ))]. 



(19) 



(20) 



Applying these bounds with a 



2e 2 



relative efficiency of the canonical expectation values 



exp 



(Ah 2 ) 



< 



Is 2 (A) J ~ (A) 
or by using the more accurate equation 



to (|T6D yields the following bounds for the 



(21) 



( ° 



a" 
2? 



(Ah 2 ) 



exp 



a 

'2e* 



(h 4 ) 



< In 



< 



a 



2e 2 



2 r (Ah 2 ) 
1 - ^-tt 1 exp 



(A) 
a 2 (Ah 4 ) 



(22) 



(A) * V 2£ 2 (Ah 2 )^ 
These equations prove, for positive observables A, the earlier alluded statement that in 
the limit e/a — > oo the approximation by the canonical ensemble becomes exact. On the 
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other hand, the bounds also suggest that it is necessary to have at least ^ < | before 
the canonical expectation values give a trustworthy approximation of the Gaussian ones. 

For a fixed E it is not in general possible to take the limit e — > oo, since then 
(3 — > and thus also a — > oo. Besides, it is also necessary to require that e <C E to 
get a meaningful approximation to the microcanonical ensemble from the Gaussian one. 
In practice, the most interesting applications of the ongoing ideas are in the energy 
region in which the canonical ensemble first begins to fail. For this reason, we also 
need an estimate in terms of the canonical quantities for the region where the canonical 
approximation is not reliable. 

Since 

E = ^'^ £2 =^{ 1 -JW)^ (23) 

should be positive, we can now give the following rules for making an identification 
between the canonical and the Gaussian ensemble 

• The value of j^j measures the effectiveness of the canonical ensemble with a given 
(3: the smaller the value, the better the canonical ensemble works and differences 
are expected to arise when it gets to be of the order of one. Note that in the 
thermodynamical limit, we have (3 oc 1/N, (H) oc iV and a 2 oc N and therefore we 
would expect this quantity to vanish as 1/N, thus implying that the Gaussian and 
the canonical ensemble become equivalent in the thermodynamical limit. 

• For those values of (3 with ^ < 1, choosing an e oc a will give the most 
reliable results. The proportionality factor need not be very large, since the 
relative accuracy of the canonical approximation depends on the second power 
of its inverse — for typical observables the accuracy can improve even more quickly 
as can be seen from equation (p2|). In any case, we get from fl2"3|) an absolute upper 
bound for the possible values of the proportionality factor, 




(3a 2 

Let us then derive an approximation for the Gaussian partition function in the 

CT 2 

2e-' 



region where the parameter a = is small. From we get the following identity 



E ' £ ^27r(a 2 + e 2 ) P \ 

where we have extracted the saddle point approximation that was given in (|15|). 
Let us denote the logarithm of the correction to the saddle point approximation by 
f{a) = |ln(l + 2a) + In (e~ ah2 ^. Equation ( pO] ) immediately yields simple bounds for 
this correction term, 

- ln(l + 2a) - a < f(a) < ~ ln(l + 2a) - ae~ a{h4) . 
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Table 1. The first five derivatives needed in the Taylor expansion of the Gaussian 
partition function. In the table, h n refers to the canonical expectation value (h n ). 

k /W(C) + ) 

1 

2 -3 + h 4 

3 6 + 3/i4 — he 

4 -54 + 12/14 - 3hj - 4h 6 + h 8 

5 360 + 60/i 4 - 30hj - 20/j 6 + I0h 4 h 6 + 5h 8 - h w 



Table 2. The first five general derivatives needed for the Taylor residuals of 
the Gaussian partition function. In the table, <?„ refers to the a-dependent ratio 
(h n exp(-a/i 2 ))/(exp(-a/i 2 )). 

k f^(a) 

^ (1+2q)3 _ 2 #2 + 3.92.94 - 36 

4 - irfc - 6 52 + 125 2 2 34 - 3.9I - 4 5256 + 58 

5 IlHV - 2452 + 60 -9254 - 30 5 254 - 20<? 2 5 6 + 103436 + 5g 2 .9 8 - ffio 



Clearly, / is an analytic function on the right half plane and it is infinitely many 
times differentiable from the right at the origin of the real axis. Thus it has a Taylor 
polynomial expansion at the origin, 

K-i k 

= E Tf / W (0 + ) + 0(a K ), for all a > 0, (24) 

fc=0 

although the corresponding full Taylor series need not converge. This situation is the 
same as is often encountered in a perturbation theory: the result is an asymptotic series 
in the perturbed coupling constant. 

Since /(0 + ) = 0, the constant term of the expansion vanishes and, by virtue of the 
saddle point approximation, also the first coefficient is zero, since (h 2 ) = 1. The rest of 
the coefficients of the expansion (p4"|) are then given by the formula 



k 



f k \0 + ) = {-2) k -\k - l)\ + In (e- ah2 



, for k > 1, 

a=0 



the first five of which are computed in table [I]. 

With a given K, the residual term Rk(cl) — denoted by 0(a K ) in ( p4f) — can be 
written as 

Rk{o) = ^f iK) (a), 

where a G [0, a] depends on a. Since all derivatives are continuous, the values of the 
derivatives at + given in table give an estimate for f^ K \a) for sufficiently small 
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Table 3. The first few normalized derivatives — as defined in equation ( |26| ) — needed in 
the Taylor expansion of the Gaussian expectation values. Here g n and A n refer to the a- 
dependent ratios (h n exp(— ah 2 )) / (exp(— ah 2 )) and (h n Aexp(— ah 2 )) / (Aexp(— ah 2 )), 
respectively. 

k G fc (a) 

1 -A 2 + g 2 

2 A 4 - 2A 2 g 2 + 2g 2 - g 4 

3 -A e + 3A 4 g 2 - §A 2 g\ + 6g% + 3A 2 g 4 - 6g 2 g 4 + g& 

4 A 8 - 4A 6 g 2 + \2A 4 g 2 - 2<±A 2 g\ + 24gf - 6^4.94 + 24A 2 g 2gi - 36g 2 g 4 + 6.g| 

-4A 2 g 6 + 8.9236 - 9s 

5 -A 10 + 5A 8 g 2 - 20A 6 g 2 + 60A 4 fff - 120A 2 .g| + 120. 9 f + 10A 6 g 4 - 60A 4 g 2 g 4 

+180A 2 g 2 g 4 - 240 ff | ff4 - 30A 2 g 2 + 90g 2 gl + 10A 4 g 6 - 40A 2 g 2 g 6 + 60glg 6 
-20g 4 g 6 + 5A 2 g 8 - IO5258 + 9io 



a. Therefore, these values can be used for estimating the residual term and thus for 
deciding the best value of K for a given, small, a. 

If a is not small enough, it is possible to use other, less accurate, estimates. Now 
f( K \a) is a polynomial of the ratios g n = (h n exp(— a/i 2 ))/(exp(— ah 2 )), which, on the 
other hand, have the exact bounds 

expf— a ^ fc+2 ) < j^- < exp(a), for all k > 1. 
V h2k ' h2k 

Therefore, it is possible to use these bounds in the known form of the polynomials 
yielding exact bounds for the values of the derivatives. The first five of the polynomials 
are given in table ^| and the rest can be easily computed, if necessary. 

The same Taylor polynomial approximation in a can be made equally well for 
expectation values of a canonical observable A. Let us define 

(Aexp(— ah 2 )) 
9{<1} = (exp(-ah 2 )) ' 

when by (0) the function g(a) equals the Gaussian expectation value of A at energy 
E = (H) — e 2 /3 and resolution e = a/ \[2a. The Taylor expansion of g is 

9(o) H.>(i/e^) + (S^^)). (») 

where the coefficients Gk are normalized derivatives, 

(exp(-ah 2 )) (k) 

Gt(a) = (j 4exp(- a tf)) 9 iah (26) 
and we have given the first few of them in table []. The expansion up to terms of order 
a 2 is therefore given by 



g(a) = (A) + ((l-h 2 )A)a + 



((l-h 2 )A) + ±((h*-(h*))A) 



a 2 + 0(a 3 
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5. Summary of the results 

In the first section we have shown that when e — > the Gaussian ensemble approaches 
the quantum microcanonical ensemble and the Gaussian expectation values pick out the 
nearest microcanonical expectation values. This makes the Gaussian ensemble an easy 
to use approximation of the microcanonical ensemble, although we have also argued 
that the ensemble can be given an independent physical interpretation in certain kinds 
of experiments. 

The main results of this paper, however, consider the opposite limit, where e is 
much larger than a typical distance between consecutive energy levels. We have shown 
how the canonical ensemble forms an accurate approximation of the Gaussian ensemble 
in this limit. On the other hand, since the Gaussian ensemble is a regularization of the 
discrete microcanonical ensemble in this limit, the canonical ensemble can also be given 
an interpretation as an approximation of this regularized microcanonical ensemble. 

If E and e are the Gaussian energy and energy resolution, respectively, then there 
always exists a unique (3 > defined by the equation (H) c ^ n = E + (3e 2 and this (3 gives 
the best inverse temperature for a canonical approximation of the Gaussian ensemble. 
This approximation is best characterized by the parameter a = a 2 /2e 2 , where a 2 is 
the variance of the energy in the canonical ensemble, a 2 = (H 2 ) c ^ n — ((H) c ^ n ) 2 . The 
approximation was shown to work at least in the region where a « 1 and, since it 
was necessary to have a > Pa 2 / (if )^ an , the latter quantity furnishes an indicator of 
whether or not the canonical ensemble can be used for getting reliable information 
about the properties of a closed system. This indicator can also be given in the form 
(3a 2 /{H) c p n = C v /P{H) c p n : where C v is the specific heat at constant volume. 

Quantitatively, we have proven the following formulas for the canonical 
approximation of the Gaussian ensemble 

o- 2 (3 2 

2"-"-"' 4a 2 
(A)^ ss = (A)f» + AA(a;P). (28) 

The correction terms, which become important in the region a ~ 1, can be given 
an asymptotic expansion in terms of the moments of the normalized canonical energy 
operator, h = (H — (H))/a, 

AS(a; (3) = \(h A ) - 3)a 2 + -(6 + 3(/i 4 > - (h 6 ))a 3 + £>(a 4 ), (29) 
2 6 



S|7 S = - Ine - - In (2rr) - - - In (1 + 2a) + AS(a; (3), (27) 



AA(a;(3) = ((l-h 2 )A)a + 



{{l-h 2 )A)+ l -{{h*-(h*))A) 



a 2 + 0(a 3 ). (30) 



The entropy deviation has also the exact bounds 

- ln(l + 2a) - a < AS(a; (3) < \ ln(l + 2a) - a exp(-a(/?, 4 )), (31) 

while for positive observables A we have derived bounds for the logarithmic proportional 
deviation 



a;auss 

{Ah') , „ 4 ^ _ W. 

— a 



{Ah 2 ) 



— exp(— a{h 4 )) 



( A Ye.s . 

< In . tt — < a 

~ {A)f» ~ 



(Ah 2 ) ( (Ah 4 ) 



1_ T4T exp V a (Ah 



(32) 
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All expectation values in the above are in the canonical ensemble unless stated otherwise. 



6. Thermodynamics of systems with exponentially increasing density of 
states 

We will now repeat the analysis done in sections [| and |ona system with exponentially 
increasing density of states. In this case, the canonical ensemble can be defined only 
up to a certain value of the inverse temperature (3 and it is not clear when and if the 
canonical ensemble will give meaningful results. However, since there are physically 
interesting systems which exhibit an exponential increase of the density of states, e.g. 
free bosonic string theory ||10|| , and since computations in the canonical ensemble are 
typically easier to perform than microcanonical ones, it is useful to know if the canonical 
ensemble can be applied to analysis of such a system. 

We first define what is meant by an exponential increase of the density of states: if 
there exists a finite number 

/? c = inf{/3>0|tre- /3H <cx)}, 

then (3 C can be identified with the speed of exponential increase of the density of states 
as clearly tre - ^ < oo for all (3 > (3 C and tre - ^ = oo for (3 < (3 C . Also, in the following 
we will consider only observables A with the property tr |^4|e — < oo for all (3 > (3 C . 
Since we have assumed that [3 C < oo, the system is Gaussianly bounded and A is a 
Gaussian observable. Note also that (3 C = corresponds precisely to the canonical case. 

Under these assumptions, everything said in the beginning of section [| especially 
formulas (|7|) (pTDj) , still hold if we only require (3 > j3 c instead of (3 > 0. Similarly, the 
analyticity of the integrand in (|10D is guaranteed only in the half plane Kew > (3 C . In 
the saddle point approximation of this integral, the uniqueness of the positive saddle 
point still holds with the same proof as before, but the existence depends crucially on 
the behaviour of the canonical ensemble at temperature 1/(3 C or, more specifically, on 
the value of E c = lim^^i? )^ an . 

If E c = oo, then there exists for all E e R and e > a unique (3 > (3 C for which 
(H) c p n = E + (3e 2 . If E c < oo, then there is a positive saddle point (3 if and only if 
E < E c — (3 c e 2 . In these two cases everything said in section [| will hold for the saddle 
point value of (3 and thus it is reasonable to use canonical ensemble for those systems 
with C v /((3H)f n < 1. 

The situation is different for those values of E, for which E > E max = E c — f3 c e 2 . 
Then there are no saddle points on the positive real axis and the best one can do with 
the canonical ensemble is to choose (3 —>■ (3 C . This approximation, however, is not good 
unless E « -E max as can be seen from the relation 

, Tr(Ap £ {E + (3 c e 2 )e-^ H ) (Ap £ (E c + E- £U)>^ 



E ' e Tr(p £ (E + f3 c e 2 )e-^ H ) ~ (p £ (E c + E - E max )) 
which is a consequence of equation (|7|). 



can 
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Thus we have found that if {H) c ^ = oo the system can always be approximated by 
the canonical ensemble in the thermodynamical limit just as has been explained in the 
preceding sections. In effect, as the system is "heated up" by adding more and more 
energy to it, the temperature of the system will increase asymptotically to the limiting 
value T c = 1/ (3 C . On the other hand, if (H) c ^ < oo, then there exists a limiting energy 
-Emax ~ (H) c ^ after which the canonical ensemble should not be used for estimating the 
statistical behaviour of the system but using some form of the microcanonical ensemble 
instead would be advisable. 



7. Discussion 



In this paper we have examined relations between the microcanonical and the canonical 
approach to quantum statistics. On the level of ideas, the present results are well-known 
and well-presented in most standard textbooks on statistical physics and, naturally, none 
of our results tells anything new about systems fully in the thermodynamical limit. 
What we have aspired to do here is to develop a systematic treatment of systems, which 
are neither large enough to be considered thermodynamical nor simple enough to be 
completely solvable, but which are, nevertheless, in an energetic equilibrium with their 
environment. 

By taking the energy fluctuations of the system as an integral part of the 
microcanonical formalism we have been able to show rigorously how the canonical 
ensemble gives an approximation of the microcanonical statistics even for systems which 
are not even near a thermodynamical limit and we have been able to give precise relations 
between canonical concepts, such as temperature, and the more fundamental concepts 
related directly to energy. We have also shown how and when the canonical ensemble 
can be stretched to aid in the analysis of these non-thermal systems. 

In our analysis of regularization of the energy spectrum we have limited ourselves 
to the Gaussian distribution. This, however, is only a convenient choice and the analysis 
could be repeated by using any smooth function with a compact support instead. 
Using this second alternative would, in fact, be necessary for microcanonically bounded 
systems with the logarithm of the density of states increasing faster than quadratically 
in energy, but we will not redo this analysis here. Neither have we yet discussed the 
evaluation of the canonical moments of the energy operator, which are required for the 
asymptotic expansion of the Gaussian expectation values — this will be the subject of 
a subsequent work, where we will also show how lattice Monte Carlo methods can be 
employed in evaluation of the Gaussian expectation values. 

For sake of mathematical defmiteness, we have in this paper inspected only quantum 
mechanical systems, but on the level of formal manipulation, the results given here can 
be equally well interpreted as statements about statistical quantum field theory. Since 
canonical quantum field theory is already a well-developed part of physicists' toolkit 
PH , this is not as bold a claim as it looks at first sight. In fact, the formulation of 



quantum microcanonical ensemble given in || can also be obtained from the e, (3 — > + 
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limit of the Gaussian formula ( JlQ| ) , which might then lead to a more rigorous derivation 
of that formulation after the intricacies in the definition of four-dimensional statistical 
quantum field theories have been resolved. 
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